Perceptive, Non-linear Speech Processing and Spiking Neural Networks

نویسندگان

  • Jean Rouat
  • Ramin Pichevar
  • Stéphane Loiselle
چکیده

Source separation and speech recognition are very difficult in the context of noisy and corrupted speech. Most conventional techniques need huge databases to estimate speech (or noise) density probabilities to perform separation or recognition. We discuss the potential of perceptive speech analysis and processing in combination with biologically plausible neural networks processors. We illustrate the potential of such non-linear processing of speech on two applications. The first is a source separation system inspired by Auditory Scene Analysis paradigm and the second is a crude spoken digit recogniser. We present preliminary results and discuss them. keywords: Auditory modelling, Source separation, Amplitude Modulation, Auditory Scene Analysis, Spiking Neurones, Temporal Correlation, Multiplicative Synapses, Cochlear Nucleus, Corrupted Speech Processing, Rank Order Coding, Speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neuromorphic Hardware As Database Co-Processors

Today’s databases excel at processing data using fairly simple operators but are not efficient at executing operators which include pattern matching, speech recognition or other cognitive tasks. The only way to use such operators in data processing today is to simulate spiking neural networks. Neuromorphic hardware is supposed to become ubiquitous in complementing traditional computational infr...

متن کامل

Compact hardware liquid state machines on FPGA for real-time speech recognition

Hardware implementations of Spiking Neural Networks are numerous because they are well suited for implementation in digital and analog hardware, and outperform classic neural networks. This work presents an application driven digital hardware exploration where we implement real-time, isolated digit speech recognition using a Liquid State Machine. The Liquid State Machine is a recurrent neural n...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Error-backpropagation in temporally encoded networks of spiking neurons

For a network of spiking neurons that encodes information in the timing of individual spike-times, we derive a supervised learning rule, SpikeProp, akin to traditional error-backpropagation. With this algorithm, we demonstrate how networks of spiking neurons with biologically reasonable action potentials can perform complex non-linear classification in fast temporal coding just as well as rate-...

متن کامل

A comparison between spiking and differentiable recurrent neural networks on spoken digit recognition

In this paper we demonstrate that Long Short-Term Memory (LSTM) is a differentiable recurrent neural net (RNN) capable of robustly categorizing timewarped speech data. We measure its performance on a spoken digit identification task, where the data was spike-encoded in such a way that classifying the utterances became a difficult challenge in non-linear timewarping. We find that LSTM gives grea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004